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ABSTRACT 

The clustering properties of a well-defined sample of 734 Ha emitters at z = 0.845 ± 
0.015, obtained as part of the Hi-z Emission Line Survey (HiZELS), are investigated. The 
spatial correlation function of these Ha emitters is very well-described by the power law 
£ = (r/rg)" 1 ' 8 , with a real-space correlation, r , of 2.7±0.3h~ 1 Mpc. The correlation length 
r increases strongly with Ha luminosity (Lh q ) , from r <~ 2 h _1 Mpc for the most quiescent 
galaxies (star-formation rates of <~ 4 M yr~ 1 ), up to ro > 5 h^ 1 Mpc for the brightest galax- 
ies in Ha. The correlation length also increases with increasing rest-frame if-band (M K ) 
luminosity, but the r -LHa correlation maintains its full statistical significance at fixed Mk- 
At z = 0.84, star-forming galaxies classified as irregulars or mergers are much more clustered 
than discs and non-mergers, respectively; however, once the samples are matched in Lh q and 
Mk, the differences vanish, implying that the clustering is independent of morphological type 
at z <~ 1 just as in the local Universe. The typical Ha emitters found at z = 0.84 reside in 
dark-matter haloes of w 10 12 M Q , but those with the highest SFRs reside in more massive 
haloes of s=y 1O 13 M . The results are compared with those of Ha surveys at different red- 
shifts: although the break of the Ha luminosity function L^ a evolves by a factor of <~ 30 
from z = 0.24 to z = 2.23, if the Ha luminosities at each redshift are scaled by L^ a (z) then 
the correlation lengths indicate that, independently of cosmic time, galaxies with the same 
(Ln a )/ Lft a (z) are found in dark matter haloes of similar masses. This not only confirms that 
the star-formation efficiency in high redshift haloes is higher than locally, but also suggests a 
fundamental connection between the strong negative evolution of L^ a since z = 2.23 and the 
quenching of star-formation in galaxies residing within dark-matter haloes significantly more 
massive than 10 12 M Q at any given epoch. 

Key words: galaxies: high-redshift, galaxies: cosmology: observations, galaxies: evolution. 



1 INTRODUCTION 

In a Universe dominated by cold dark matter, galaxies are found in 
dark matter haloes with a structure determined by universal scaling 
relations (e.g. Navarro et al. 1996). Since baryons trace the under- 
lying distribution of dark matter, measurements of the clustering 
of baryonic matter can be used to extract typical dark-matter halo 
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masses (Mo & White 1996; Sheth et al. 2001) and to suggest links 
between populations found at different epochs. 

Wide surveys of the nearby Universe (e.g. 2dF, SDSS; Col- 
less et al. 2001; York et al. 2000) have now assembled extremely 
large samples of galaxies which can be explored to study their clus- 
tering properties in great detail. Using those data, several studies 
have found that the amplitude of the two-point correlation func- 
tion rises continuously with galaxy luminosity (e.g. Norberg et al. 
2001; Zehavi et al. 2005; Li et al. 2006). They also reveal that the 
most rapid increase occurs above the characteristic luminosity, L* . 
Red galaxies are found to be more clustered than blue galaxies, 
but the correlation amplitude of the latter population also increases 
continuously with blue or near-infrared luminosities (e.g. Zehavi 
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et al. 2005). This seems to indicate that both star-formation rate 
(SFR, traced by ultraviolet/blue light) and stellar mass (traced by 
near-infrared light) are important for determining the clustering of 
different populations of galaxies. Other studies have used morpho- 
logical classifications. Skibba et al. (2009), for example, took ad- 
vantage of the largest sample of visually classified morphologies to 
date (from SDSS) to clearly reveal that although early-type galax- 
ies cluster more strongly than discs, once the analysis is done at a 
fixed colour and luminosity no significant difference is found; this 
confirms previous results (e.g. Beisbart & Kerscher 2000; Ball et al. 
2008) for the nearby Universe. 

Understanding when these trends were created and how they 
evolved with cosmic time is a key input to galaxy formation mod- 
els and to our general understanding of how galaxies formed and 
evolved. The first clustering studies of different populations of 
galaxies beyond the local Universe (e.g. Efstathiou et al. 1991; 
Fisher et al. 1994; Brainerd et al. 1995; Le Fevre et al. 1996), 
although pioneering, were mostly limited by the lack of informa- 
tion on the redshifts for their samples. Fortunately, those problems 
are now starting to be effectively tackled by larger and deeper sur- 
veys. Such surveys have recently led to robust clustering studies of 
specific populations of sources at moderate and high redshift such 
as AGN (e.g. da Angela et al. 2008), sub-mm galaxies (e.g. Blain 
et al. 2004; WeiB et al. 2009), luminous red and massive galaxies 
(e.g. Wake et al. 2008), or star-forming galaxies such as Ha emit- 
ters, Lyman-break galaxies or Lyman-a emitters (e.g. Geach et al. 
2008; McLure et al. 2009; Shioya et al. 2009). A dependence of 
the clustering on galaxy luminosity has also been identified beyond 
the local Universe (e.g. Giavalisco & Dickinson 2001). In partic- 
ular, Kong et al. (2006), Hayashi et al. (2007) and, more recently, 
Hartley et al. (2008) found a clear dependence of the clustering am- 
plitude on near-infrared luminosity for z ~ 1 — 2 "BzK" selected 
galaxies, indicating that already in the young Universe the most 
massive galaxies were much more clustered than the least massive 
ones. These results mean that although different studies have been 
combined to suggest links between the same population at different 
redshifts, or between different populations at different epochs, in- 
terpreting these requires a great deal of care, since luminosity lim- 
its typically increase significantly with redshift and are different for 
different populations. 

Narrow-band Ha surveys are now a very effective way to ob- 
tain representative samples of star-forming galaxies. Since the Gal- 
lego et al. (1995) wide-area study in the local Universe, tremen- 
dous progress has been achieved, with recent narrow-band studies 
(e.g. Ly et al. 2007; Shioya et al. 2008; Geach et al. 2008; Shim 
et al. 2009; Sobral et al. 2009a) taking advantage of state-of-the-art 
wide-field cameras and obtaining large samples of typically hun- 
dreds of Ha emitters from z = to z = 2.23, down to limit- 
ing star-formation rates (SFRs) of « 1-10 Moyr -1 . In particular, 
HiZELS, the Hi-z Emission Line Survey (c.f. Geach et al. 2008, 
Sobral et al. 2009a; hereafter G08 and S09, respectively), is obtain- 
ing and exploring unique samples of star-forming galaxies present- 
ing Ha emission (and other major emission lines, e.g. Sobral et al. 
2009b), redshifted into the J, H and K bands. By using a set of 
existing and custom-made narrow-band filters, HiZELS is survey- 
ing Ha emitters at z = 0.84, z — 1.47 and z = 2.23 over several 
square degree areas of extragalactic sky. 

Narrow-band surveys have a great potential for determining 
the clustering properties of large samples of galaxies and their evo- 
lution with cosmic time. Such surveys probe remarkably thin red- 
shift slices (Az « 0.02), which not only provides an undeniable 
advantage over photometric surveys that can be significantly af- 



fected by systematic uncertainties, but also allows the study of very 
well-defined cosmic epochs. Additionally, the selection function is 
well-understood and easy to model in detail, a feature that con- 
trasts deeply with that of current large high-redshift spectroscopic 
surveys. Narrow-band surveys also populate the redshift slices they 
probe with high completeness down to a known flux limit, in con- 
trast to spectroscopic surveys which are usually very incomplete at 
any single redshift and present the typical pencil-beam distribution 
problems. Finally, narrow-band surveys can select equivalent pop- 
ulations at different redshifts, making it possible to really under- 
stand the evolution with cosmic time, avoiding the biases arising 
from comparing potentially different populations. 

This paper presents a detailed clustering study of the largest 
sample of narrow-band selected Ha emitters at z ~ 1 (c.f. S09), 
obtained after conducting deep narrow-band imaging in the J band 
over ~ 1.3 deg 2 , as part of the HiZELS survey. It is organised in 
the following way. §2 outlines the data, the samples and their on- 
sky distribution. §3 presents the angular two-point correlation func- 
tion, carefully estimating the errors and other potential bias, along 
with the exact calculation of the real-space correlation. §4 quan- 
tifies the clear clustering dependences on the host galaxy proper- 
ties. §5 presents the first detailed comparison between the cluster- 
ing properties of Ha emitters from z = 0.24 up to z = 2.23. Fi- 
nally, §6 presents the conclusions. An Ho = lOOh kms -1 Mpc -1 , 

= 0.3 and £7a = 0.7 cosmology is used and, except where 
otherwise noted, magnitudes are presented in the Vega system and 
h = 0.7. 



2 DATA AND SAMPLES 

2.1 The sample of Ha emitters from HiZELS 

This paper takes advantage of the HiZELS large sample of Ha 
emitters at z — 0.845 presented in S09 (the reader is referred to that 
paper for full details of how the sample was constructed). Briefly, 
the sample was obtained from data taken through a narrow-band 
J filter (A off = 1.211 ± 0.015nm) using the Wide Field CAMera 
(WFCAM) on the United Kingdom Infrared Telescope (UKIRT), 
and reaches an effective flux limit of 8 x 10~ 17 erg s _1 cm -2 over 
~ 0.7 deg 2 in the UKIDSS Ultra Deep Survey (UDS, Lawrence 
et al. 2007) field, and ~ 0.8 deg 2 in the Cosmological Evolution 
Survey (COSMOS, Scoville et al. 2007) field. The data allowed the 
selection of a total of 1517 line emitters which are clearly detected 
in NBj (signal-to-noise ratio > 3) with a J-NB,j colour excess sig- 
nificance of E > 2.5 and observed equivalent width EW> 50 A. 

As the detected emission may originate from several possible 
emission lines at different redshifts, photometric redshifts 1 were 
used to distinguish between them and select the complete sample 
of 743 Ha emitters over a co-moving volume of 1.8 x 10 5 Mpc' 5 
at z = 0.84. Of these, 477 are found in COSMOS (0.76 deg 2 ) and 
266 in UDS (0.54 deg 2 ; the reduction in area is driven by the over- 
lap with the high-quality photometric catalogue used - c.f. S09 for 
more details)). The completeness and reliability of this sample were 
studied using the ~ 10 4 available redshifts from zCOSMOS Data 
Release 2 (Lilly et al. 2009), spectroscopically confirming ~ 100 
Ha emitters within the photometric redshift selected sample; this 

1 Photometric redshifts used in S09 for COSMOS present a(Az) = 0.03, 
where Az = {zp^t — z spe c)/(l + z spec ); the fraction of outliers, defined 
as sources with Az > 3<r(Az), is lower than 3per cent, while for UDS, the 
photometric redshifts have a(Az) = 0.04, with 2 per cent of outliers. 
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Figure 1. The on-sky distribution of the Ha emitters found at z = 0.84 in the UDS (left panel) and COSMOS (right panel) fields, together with a photometric 
redshift selected sample at the same redshift. Both panels/boxes cover the same angular/physical area, corresponding to ss 29 X 29Mpc at z = 0.84. The Ha 
emitters are plotted in 3 different symbols corresponding to different star-formation rates (see key in left panel). 



allowed to estimate a > 95 per cent reliability and > 96 per cent 
completeness for the sample in COSMOS. Ha fluxes, Ha lumi- 
nosities and Ha-based star-formation rates were obtained for all 
Ha candidates using HiZELS data - the details on how these were 
computed are fully detailed in S09. 

Since the work presented in S09, improved photometric red- 
shifts for COSMOS and UDS have been produced, incorporating 
a higher number of bands, deeper data and accounting for possible 
emission-line flux contamination of the broad-bands (c.f. Cirasuolo 
et al. 2009; Ilbert et al. 2009, Cirasuolo et al. in prep.); these further 
confirm the robustness and completeness of the selection done in 
S09. Nevertheless, the Ha sample used for the analysis in this paper 
is slightly modified from that in S09 on the basis of the new pho- 
tometric redshifts. In particular, the sample in UDS now contains 
257 Ha emitters over 0.52 deg 2 . The very small reduction in area 
simply results from the overlap with deep mid-infrared data used 
for deriving the new improved photometric redshifts - this reduc- 
tion places 3 sources from the initial sample outside the coverage. 
The remaining 6 sources excluded from the S09 sample were re- 
moved on the basis that the new photometric redshifts clearly place 
those sources at z ~ 2.2 (2) and z w 1.45 (4) (they are therefore 
identified as candidate [Oil] 3727 and [Om] 5007/H/? emitters, re- 
spectively). For COSMOS, the sample of Ha emitters is the same 
as in S09 (477 emitters over 0.76 deg 2 ), since the new photometric 
redshifts do not change any of the classifications (noting that spec- 
troscopic data had already been used to produce a cleaner sample 
in S09). 

As the samples were obtained in two of the best studied 
square degree areas, a wealth of multi-wavelength data are avail- 
able. These include deep broad-band imaging from the ultra-violet 
(UV) to the infrared (IR) - including deep Spitzer data. These 
data make it possible to compute rest-frame luminosities for the 
sample of Ha emitters at z = 0.84. Here, rest-frame B luminosi- 
ties (Mb) are estimated by using i + -band data (probing 4152.6 A 
at z = 0.84) obtained with the SUBARU telescope in both UDS and 
COSMOS. This is obtained by applying an aperture correction of 
-0.097 mag (to recover the total flux for magnitudes measured in 3 
arcsec apertures) and a galactic extinction correction of -0.037 to 
i + magnitudes (Capak et al. 2007). It is assumed that all sources 
are at a luminosity distance of 5367 Mpc (z = 0.845); this yields 



a distance modulus of 43.649. Rest-frame K luminosities (Mk) 
are computed following Cirasuolo et al. (2009) by assuming the 
same luminosity distance and interpolating using deep 3.6 (im and 
4.5 pim Spitzer data. 

Furthermore, the COSMOS field has been imaged by 
ACS/HST, and the sample of Ha emitters has been morphologically 
classified both automatically, with ZEST (Scarlata et al. 2007), and 
visually, as detailed in S09. The galaxies were classified into early- 
types, discs and irregulars and, independently (visually-only), into 
non-mergers, potential mergers and mergers (c.f. S09). The sample 
has also been investigated for AGN contamination both in S09, us- 
ing emission-line diagnostics, and in Garn et al. (2009) making use 
of a wide variety of techniques. 

Figure 1 presents the distribution of the Ha emitters at z — 
0.84 for the UDS and COSMOS fields. The panels visually indi- 
cate how these emitters cluster across these two regions of the sky 
relative to a simple photometric redshift selected population at the 
same redshift, selected with 0.82 < z p hoto < 0.87. 



2.2 The random sample 

Random catalogues are essential for robust clustering analysis and 
an over- or under-estimation of the clustering amplitude can eas- 
ily be obtained if one fails to produce accurate random catalogues. 
These are produced by generating samples with 100 times more 
sources than the real data and by distributing those galaxies ran- 
domly over the geometry corresponding to the survey's field-of- 
view. This takes into account both the geometry of the fields and 
the removal of masked areas (due to bright stars/artefacts) - see 
discussion of masked regions in S09 for more details. 

While the survey is fairly homogeneous in depth, there are 
some small variations from chip to chip and field to field reaching 
a maximum of ~ 0.2 mag (NB,/ of 21.5 to 21.7 mag). This corre- 
sponds to a maximum variation in luminosity [in log(L)] of ~ 0.1. 
The observed luminosity function presented in S09 shows that go- 
ing deeper by 0. 1 in log(L) increases the number count by ~ 20 per 
cent. This can have an effect on estimating the clustering, although 
simulations indicate that this is smaller than the measured errors. 
The final random catalogues were created reproducing source den- 
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sities varying in accordance with the measured depths and the S09 
number counts. 



3 THE CLUSTERING PROPERTIES OF Ha EMITTERS 

3.1 The two-point cu(8) correlation function at z = 0.84 

In order to evaluate the two-point angular correlation function, the 
minimum variance estimator suggested by Landy & Szalay (1993) 
is used: 

where DD(0) is the number of pairs of real data galaxies within 
(8, 8 + 68), DR(#) is the number of data-random pairs and RR(0) 
is the number of random-random pairs. Nr and Nd are the number 
of random and data galaxies in the survey. Errors are computed 
using the Poisson estimate (Landy & Szalay 1993): 



Aw(0) = 



y/T>D(0) ' 



(2) 



The angular correlation function is computed for the entire 
sample of Ha emitters 2 , as well as for COSMOS and UDS sepa- 
rately, and it is found to be very well-fitted by a power-law with 
the form A8 13 with 8 in arcsec. The power-law fit is obtained by 
determining u(6) 2000 times using different random samples and 
a range of bin widths (A log 8 = 0.1 — 0.3, randomly picked for 
each determination) and by performing a x fit to eacn °f these 
(over 5 < 8 < 600 arcsec, corresponding to 38.2 kpc to 4.5 Mpc 
at z — 0.845; these correspond to angular separations for which 
fitting oj(8) with one single power-law is appropriate; see details in 
Section 3.2.2). The results are shown in Figure 2 and presented in 
Table 1; they imply A = 14.1 ± 3.9 and /? = -0.79 ± 0.06 for 
the entire sample (the errors present the la deviation from the av- 
erage). This reveals a very good agreement with the fiducial value, 
P = -0.8. 

Recent studies (e.g. Ouchi et al. 2005) have found a transition 
between small and large-scale clustering (corresponding to one- 
and two-halo clustering, respectively) at high redshift, manifested 
as a clear deviation from the power-law at small scales (smaller 
than « 50 kpc for Lyman-break galaxies at z ~ 4, for example). 
While there is no definitive evidence for the one-halo term for the 
Ha sample presented here (especially for separations larger than 
~ 5" - corresponding to 38 kpc), for smaller separations the results 
point towards a departure from the power-law, but only at « la- 
level. Therefore, a larger sample is required to reliably determine 
oj(8) down to the smallest scales and constrain the contribution of 
the one-halo term. 

The best power-law fit seen in Figure 2 also reveals a good 
agreement both at small and larger scales between the COSMOS 
and UDS fields (see also Table 1); the UDS field presents a slightly 
higher clustering amplitude, but consistent within la (fixing P — 
—0.8). Note that the Ha luminosity function derived in S09 for 
each field also revealed a good agreement between the two fields. 
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Figure 2. The two-point angular correlation function for the entire sample 
and for the COSMOS and UDS fields separately for angular separations up 
to 600 arcsec, and the la, 2a and 3a co- variance contours when fitting both 
A and for the entire sample compared to the results of each field. These 
results show a very good agreement between COSMOS and UDS, reveal- 
ing that Ha emitters in completely different regions of the sky cluster with 
comparable amplitudes and power-law slopes. The results also show that 
there is no clear departure from the power-law for this sample of Ha emit- 
ters down to 5", implying that the 2-halo contribution to u>(9) is dominant 
for scales larger than ~50 kpc, but there is tentative evidence of a significant 
1-halo contribution to ui(9) for smaller scales. 



Table 1. The power-law fit parameters which best describe ui(6) for Ha 
emitters at z = 0.84, resulting from computing ui(6) 2000 times. These 
were obtained over 5 < 9 < 600 arcsec, corresponding to 38 kpc to 
4.5 Mpc at z = 0.84, avoiding both the possible one-halo clustering contri- 
bution (at the smallest scales) and the break of the Limber's approximation 
(at the largest scales). Note the degeneracy between A and in Figure 2. 
A^ = _o.8 is obtained by fixing /3 = —0.8, the fiducial value and in excel- 
lent agreement with what has been found for the entire sample with or with- 
out possible/likely AGN contamination. An error of 20 per cent is added in 
quadrature to random error in AAp = _g g to account for cosmic variance 
for the entire sample (or 25 per cent when considering just one sub-field - 
COSMOS or UDS, c.f. Section 3.1.1). 



Sample 


Number 


A 


P 


As=-0.8 


(z = 0.84) 


# 


(9 in arcsec) 




(9 in arcsec) 


All Emitters 


734 


14.1 ± 3.9 


-0.79 ±0.06 


14.2 ±3.1 


No AGN 


660 


13.6 ±4.4 


-0.78 ±0.07 


14.1 ± 3.0 


No likely AGN 


700 


15.4 ±4.0 


-0.81 ± 0.06 


14.6 ±3.2 


COSMOS 


477 


11.1 ± 3.0 


-0.75 ±0.08 


13.8 ±3.7 


UDS 


257 


19.2 ±8.9 


-0.84 ± 0.15 


18.4 ±5.5 



These results are consistent with ~ 0.6 — 0.8 deg fields being 
sufficient to overcome most of the effects of cosmic variance when 
conducting clustering analysis with narrow-band surveys at z ~ 1 
- although the agreement could be caused by chance. 



2 Due to the finite area probed, the measured clustering amplitude will be 
underestimated by an amount C (known as the integral constrain; c.f. Roche 
et al. 2002) which depends on the assumed true power-law and the probed 
area. This is estimated as C = 0.0023 for the entire survey area and it only 
represents Rj 0.01 per cent change in A, but it is still included, since this 
becomes significant for obtaining u(9) at large separations; c.f. Figure 4. 



3.1.1 Robust error estimation and the effect of cosmic variance 

The survey covers 1.3 deg 2 in two completely independent fields; 
this is a fundamental advantage over smaller surveys which only 
probe one single field, as it allows, in principle, a reliable estimate 
of possible errors due to cosmic variance. In order to achieve this, 



The clustering and evolution of Ha emitters at z ~ 1 from HiZELS 5 



80 



< 5 ° 

'« 30 
■ 

CD 
O 
£= 

=> 



20 



• 1 00 to 1 000 measurements 

• COSMOS-UDS difference 

• Bootstrap error 

O Extrapolated (full area) 



oo 



0.05 



0.1 



0.2 0.4 

Area(deg 2 ) 



0.8 



1.6 



Figure 3. The uncertainty in A (per cent; A A = a/A X 100, with 
j3 = —0.8) derived from 100-1000 u>{6) measurements on sub-samples 
spanning different areas within the complete 1.3 deg 2 (up to 0.5 deg 2 ) - 
which is considered to be mostly due to cosmic variance. This reveals that 
AA decreases with increasing area and this trend is very well fitted by a 
simple power-law (20 X per cent, with in deg 2 , shown by the 

dashed line). The simple standard deviation between results from the COS- 
MOS and UDS fields is also shown for comparison. The error estimated 
from the bootstrap analysis and that given by extrapolating the fitted power- 
law are also shown for the full area. 



ui(9) is estimated over randomly picked square areas of different 
sizes (from 0.05 deg 2 , corresponding to the area probed by one 
WFCAM chip, up to 0.5 deg 2 ) in COSMOS and UDS. The mini- 
mum number of emitters ranges from 15 to 65 within the smaller 
areas used (0.05 deg 2 ). Either 100 and 1000 randomly chosen re- 
gions are considered for each area (100 for 0.3-0.5 deg 2 and 1000 
for smaller areas) and a power law is fitted to each uj(6) determi- 
nation by fixing = —0.8. For each area, the standard deviation 
on the amplitude A is used to quantify the uncertainty. The results 
are shown in Figure 3 and demonstrate that the measured standard 
deviation is effectively reduced with area. The error in A (per cent) 
can be approximated by a power law of the form 20 x 0~ - 35 with 
8 in deg 2 ; extrapolating this suggests an error slightly lower than 
~ 20 per cent for the total area of the survey, and also agrees with 
the small difference found between COSMOS and UDS (see Figure 
3). Furthermore, this suggests that one requires areas of ~ 7 deg 2 
(the HiZELS survey target) or more to reduce the effects of cosmic 
variance to less than 10 per cent. 

These estimates could still slightly underestimate the errors, 
mostly because at larger areas the number of approximately inde- 
pendent areas is strongly reduced. An alternative estimate of the 
clustering uncertainty can be derived using a bootstrap analysis. 
Since this often leads to a slight over-estimation of the errors, it 
can be combined with the previous analysis to give an approximate 
range for the expected error. The analysis is done by naturally di- 
viding the complete sample into 26 regions of ~0.05 deg each 
(these are the minimum individual probed areas as these are cov- 
ered by one WFCAM chip) and computing oj(6) with 26 randomly 
picked regions each time for 10000 times. Fitting all realizations 
with a power law (fixing j3 = —0.8) results in a distribution with 
a standard deviation of 23 per cent in the clustering amplitude, A. 
When compared and combined with the previous estimation (w 18 
per cent error), it suggests an error of ~ 20 per cent in the clus- 



tering amplitude; this will be added in quadrature to the directly 
calculated errors in A. 



3.1.2 AGN contamination 

Some of the Ha emitters are likely to contain an AGN. By us- 
ing emission-line ratio diagnostics for a small subset of emitters in 
zCOSMOS, S09 estimated a contamination of ~15 per cent AGN 
in the sample. More recently, Garn et al. (2009) performed an ex- 
tended search for AGN within the sample using several methods for 
identifying those sources, such as radio, X-rays and mid-infrared 
colours. This resulted in the identification of a maximum of 74 
AGN (40 in COSMOS and 34 in UDS). From these, 34 are clas- 
sified as likely AGN and the remaining 40 as possible AGN - this 
corresponds to an estimated ~ 5-11 per cent AGN contamination 
within the sample. 

The AGN may well have different clustering properties from 
the star-forming population. However, the nature of these potential 
AGN contaminants is very unclear, particularly the origin of the 
detected Ha emission. For example, they are mostly morphologi- 
cally classified as irregulars and mergers and they span the entire 
luminosity range; it thus seems that although some of them might 
have their Ha emission powered by the AGN, they may well be 
undergoing significant star-formation. Indeed, recently, Shi et al. 
(2009) presented a study of unambiguous AGN at z ~ 1, showing 
that at least half of the sample shows clear signatures of intense 
star-formation. 

In order to understand how AGN contaminants might affect 
the clustering measurements, the angular correlation function was 
calculated after removing all the potential AGN contaminants. This 
results in no statistical change in either A or j3 for the entire sample 
(see Table 1). Removing only the likely AGN leads to an identical 
result. However, the true AGN contaminants (and the possible bias 
they may introduce) might well have a larger effect on the results 
when studying the clustering as a function of host galaxy properties 
(presented in Section 4), since for certain host galaxy properties the 
AGN contamination may be significantly higher than in the sample 
as a whole. 

To ensure that the analysis is really tackling the AGN contam- 
ination and to robustly determine the clustering properties of each 
sample being studied (and test trends), all the clustering proper- 
ties in this paper (for the z = 0.84 sample and sub-samples) are 
derived from a combination of measurements from samples with 
and without possible AGN contaminants. In practice, oj(6) is com- 
puted 500 times each using i) all the emitters in the sample, ii) 
removing likely AGN, and iii) removing all (likely plus possible) 
AGN. x 2 fits are obtained for each realization of each sample and 
the total resulting distribution is used for the analysis. This also 
allows to carefully confirm that there is a good degree of consis- 
tency between the 3 different distributions and in no case is the 
difference between samples with and without AGN larger than the 
lcr errors. This mostly results in estimating larger errors than those 
which would be obtained from either simply excluding all AGN 
or not dealing with AGN contamination, and therefore represents a 
conservative approach. 



3.2 Real-space correlation 

3.2.1 First order approximation: Limber 's equation 

The real-space correlation, r , is a very useful description of the 
physical clustering of galaxies when the spatial correlation func- 
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tion is well-described by £ = (r/ro) 7 . The inverse Limber trans- 
formation (Peebles 1980) can be easily used to obtain an approx- 
imation of the spatial correlation function 3 from the angular cor- 
relation function, provided the redshift distribution is known. For 
narrow-band surveys, the expected redshift distribution depends 
solely on the shape of the narrow-band filter profile; this can be 
well approximated by a Gaussian 4 , which, for the Ha emission- 
line, corresponds to a redshift distribution centered at z = 0.845 
with a — 0.0075. This can be compared with the redshift distribu- 
tion from Ha emitters confirmed by zCOSMOS (93 sources) which 
has a mean redshift of 0.844 and a = 0.0076; the excellent agree- 
ment reveals that the real redshift distribution should be very close 
to the one assumed. 

In order to calculate the real space correlation length, ro, and 
when performing the de-projection analysis, it is assumed that the 
redshifts are drawn from this Gaussian distribution and that the 
real-space correlation function is independent of redshift (c.f. G08; 
Kovac et al. 2007) over the redshift range probed; this is a very good 
approximation, since the redshift distribution is extremely narrow. 
Contamination by sources which are not Ha emitters at z = 0.84 
could have a significant effect on ro, since the real redshift distribu- 
tion will be different from the one assumed. Nevertheless, in S09 
the sample was studied to find that only 2 out of 90 sources with 
spectroscopic redshifts that had been photometrically selected as 
Ha emitters were not real Ha emitters (one [Sll]6717 emitter at 
z = 0.79 and a [Olll] 5007 emitter at z = 1.42, which were then 
removed from the sample). This conclusion is drawn from the anal- 
ysis of limited spectroscopic data (~ 20 per cent of the HiZELS 
sample in COSMOS), but contamination at that level will only lead 
to underestimating ro by a maximum of 6 per cent 5 . Throughout 
this paper, a 5 per cent correction is applied when computing ro to 
account for the expected contamination. 

Computing ro for each realization of u)(6) fitted with a power- 
law with /3 = -0.8 results in r = 2.5 ± 0.3 IT 1 Mpc for the 
entire sample, or ro = 2.6 ± 0.3 h _1 Mpc when accounting for 5 
per cent contamination. Note that the 20 per cent error in A due 
to cosmic variance results in an error of 1 1 per cent in ro, and this 
is added in quadrature. However, as mentioned before, Limber's 
equation is only an approximation which can potentially result in 
significant errors for narrow-band surveys at high redshift. Section 
3.2.2 describes how ro is robustly calculated by fully de-projecting 

<•>(*)■ 

3.2.2 Accurate determination of ro for narrow-band surveys 

The errors introduced by the Limber's approximation are tack- 
led by numerically integrating the exact equation connecting the 
spatial and angular correlation functions (following Simon 2007). 
This relation implies that spatial correlation functions described by 
£ = (r/r ) 7 are projected as angular correlation functions with 

3 Limber's equation is an approximation that breaks down for large angular 
separations when one uses a narrow filter, but can still be used to obtain at 
least a first approximation of ro within « 15 per cent for the NBj filter 
used; see Section 3.2.2 for details. 

4 A simpler way to model the narrow-band filter is to assume it is a top- 
hat; this results in a Ha redshift distribution of 0.845 ± 0.015. The two 
approaches produce results for different ro determinations which are con- 
sistent within 5 per cent. 

5 Assuming that contaminants will not cluster significantly, the contamina- 
tion fraction of / will result in a maximum underestimation of A given by 
(l-/) 2 , and an underestimation of r given by ~ (l-/) 2 /' 7 ' . 
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Figure 4. The two-point angular correlation function for the entire sam- 
ple, compared with the simple power-law fit expected from the Limber's 
approximation. These are compared with the best (from a x 2 fit - shown 
in the figure) exact angular correlation function obtained with the narrow- 
band filter profile for a spatial correlation function given by the power-law 
£ = (r/r ) 7 , with 7 = -1.8 and r = 2.61 ± 0.13b" 1 Mpc (not cor- 
rected for contamination; errors directly from x 2 ), revealing an excellent 
agreement with the data. This also reveals the regime for which the Limber's 
approximation breaks for this particular case (~ 600 arcsec for a ~ 15 per 
cent difference; c.f. Simon 2007 for a general analysis of the typical sepa- 
ration at which Limber's approximation breaks down). 



slopes f3 — 7 + 1 for small scales and p = 7 for large angular sep- 
arations when using a narrow filter - see full discussion in Simon 
(2007). 

Here, it is assumed that the spatial correlation function is given 
by the power-law £ = (r/r ) 7 (as this is able to reproduce the 
observed cj(9) very well with 7 = —1.8), and that the narrow-band 
filter profile is described by a Gaussian as detailed in Section 3.2.1. 
The following is then numerically integrated for the same angular 
separations as for the data and a \ 2 fit is done for u(6) around the 
value of ro obtained in the previous Section: 

a ( g\ /-i f + °° r 2f s (s-A)f s (s + A) JDJ 
/W(*)=* J o R _ lr(]A dR ds , (3) 

where tp = 1 + cos0, (f> = 1 - cos (9, A = ^(R 2 - 2s 2 (j>)/(2ip) 
and fs is the profile of the filter being used in co-moving distance 
(assumed to be a Gaussian distributed value with an average of 
2036.3b' 1 Mpc and a = 14.0h _1 Mpc). The \ 2 fitting results 
in a much more robust estimate of ro and allows the use of uj{9) up 
to larger angular separations than 600 arcsec, a regime for which a 
single power-law starts to become inadequate (as P changes from 
7 + 1 to 7, c.f. Figure 4). For the entire sample, however, this 
leads to little change: ro = 2.7 ± 0.3 h -1 Mpc (this includes a 5 
per cent correction for contamination, while the errors also include 
cosmic variance - this error estimation has been discussed in Sec- 
tion 3.1.1). Indeed, whilst Limber's equation breaks down for large 
galaxy separations, it is shown to do well as long as only smaller 
angular distances are considered. Note that real-space correlations 
for different samples (see Table 2) are computed in the following 
sections using the same procedure described here. 
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3.2.3 The dependence of the redshift distribution on limiting line 
luminosity for narrow-band surveys 

Since the narrow-band filter profile is not a perfect top-hat, emitters 
with different Ha luminosities are detected over a slightly different 
range of redshifts. As the exact relation between £(r) and oj(9) 
depends on the redshift distribution (given by the filter profile), as- 
suming the same redshift distribution for different luminosity limits 
can lead to biases, especially when looking at a possible relation- 
ship between the clustering amplitude and Ha luminosity. 

This potential bias is studied by performing the same sim- 
ulations described in S09. Briefly, the Ha luminosity function 
presented in S09 is used to generate a fake population of emit- 
ters equally distributed over a wider range of redshifts than those 
probed by the narrow-band filter and this is used to look at the re- 
covered redshift distribution of sources as a function of measured 
luminosity. The redshift distribution is found to become continu- 
ously narrower with increasing Ha luminosity limit 6 , although all 
the distributions can be equally well-fitted by a Gaussian. The vari- 
ation can be written as a function of observed Ha flux as: 

a = -r) x (log F Ha (i im it) - log F ) + o (h (4) 

where Fo is a flux limit at which the redshift distribution is rela- 
tively well understood and ct is the width of the redshift distribu- 
tion at that flux limit; for the NBj filter, r\ = 0.00117, as derived 
from simulations. 

Over the luminosity range of the sample (and for the assumed 
luminosity function and the real filter profile), neglecting this effect 
results in over-estimating ro by a maximum of 8 per cent; this is 
only found when computing ro for a sample containing the brightest 
5 per cent emitters (the £s 40 brightest emitters, for which a w 
0.0065). Nevertheless, narrow-band surveys which span a much 
wider luminosity range will be much more sensitive to this and 
line-luminosity trends could naturally arise from this bias. 

This luminosity dependence is fully taken into account when 
computing ro for sub-samples defined by different Ha luminos- 
ity/SFR limits. For the analysis of other sub-samples, if the Ha 
luminosity distribution does not present any clear off-set from that 
of the entire survey, it is assumed that the redshift distribution of 
those sub-samples is very well approximated by the complete red- 
shift distribution. 



4 THE CLUSTERING DEPENDENCES ON HOST 
GALAXY PROPERTIES 

As discussed in the Introduction, in the local Universe the cluster- 
ing has been shown to depend on several host galaxy properties. At 
z = 0.24, Shioya et al. (2008) have shown that the clustering of 
Ha emitters seems to be stronger for the most luminous sources in 
Ha, but so far testing and quantifying that clustering dependence 
at higher redshifts for Ha emitters has not been possible, as sur- 
veys have lacked area and sample size. The HiZELS sample is large 
enough to achieve this. 

The complete sample is divided into several sub-samples in 



6 Although intrinsically more luminous sources are detectable over a wider 
redshift range, their detection in the wings of the filter leads to an under- 
estimation of their luminosity; galaxies are only measured to be luminous 
in Ho when the emission line is being detected near the peak of the filter 
profile. 
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Figure 5. The two-point angular correlation function for sub-samples ob- 
tained by splitting the sample in 2 halves based on Ha luminosity/SFR. 
These clearly show that the clustering amplitude is higher for galaxies with 
higher Ha luminosities, implying that the most active star-forming galaxies 
are more clustered than the least active. Lines show the best \ 2 to the exact 
relation between £(r) andtu(9) by fixing 7 = —1.8; which (after applying 
all corrections already detailed) result in ro = 4.84 ± 0.58 h Mpc for 
the brightest emitters and 2.52 ± 0.41 h _1 Mpc for the faintest half (in Ha 
luminosity) of the sample. 

order to investigate the clustering as a function of fundamental ob- 
servable host galaxy properties. These include i) Ha luminosity 
corrected for extinction as in S09 (1 mag), ii) rest- frame K lumi- 
nosity (Mk), iii) rest-frame B luminosity (Ms) and iv) morpho- 
logical class (discs, irregulars; non-mergers, mergers). The ui(0) 
correlation function is computed for each sub-sample (a single ex- 
ample is shown in Figure 5, presenting u){6) for the brightest and 
faintest halves of the sample with respect to Ha luminosity). The 
approach detailed in the previous sections is then used by firstly 
computing ro using Limber's approximation power-law fit to u)(6) 
(fixing (3 = —0.8 and restricting the analysis to 5 < 8 < 600 
arcsec), and then using the exact relation between the spatial and 
angular correlation functions to improve the ro estimation. This 
analysis also accounts for the potential AGN contamination in the 
sample, following the procedures described at the end of Section 
3.1.2 (computing u){6) with and without the possible AGN contam- 
inants and using the combined distribution). The results obtained 
when splitting the sample into different sub-samples are presented 
in Table 2, Figure 6 and Figure 7. For sub-samples obtained from 
the complete survey area, an error of 11 per cent in ro is added 
in quadrature to the 1 o errors (see Section 3.1.1), while for sub- 
samples derived based on morphology a 14 per cent error is added 
in quadrature to Ar (for the COSMOS field only). 

4.1 Ha luminosity/ star-formation rate 

Following previous studies, such as Shioya et al. (2008), one can 
do a simple splitting of the sample in two halves with the same 
number of emitters (367 with 41.95 < log Lhc« < 43.18 ergs -1 
and equal number with 41.61 < log Lu a < 41.95 ergs -1 ), and 
study the clustering properties of each of those samples. Figure 5 
presents the angular correlation function obtained for the bright- 
est and faintest halves of the sample, revealing that the bright- 
est Ha emitters are much more clustered than the faintest ones. 
This is also very clear when one compares the spatial correla- 
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Figure 6. The dependence of the clustering length, ro , with Ha luminosity 
at z = 0.84. This clearly reveals that galaxies presenting higher Ha lumi- 
nosities/SFRs are more clustered, and that galaxies above the break in the 
Ha luminosity function (L^ a ) are much more clustered than those below. 
This dependence is not a result of the correlation between SFR (Ha lumi- 
nosity) and stellar mass (M K , see Figure 8), as r correlates as well with 
Ha luminosity for sub-samples with the same Mk ■ 

tions obtained for each sample: while the brightest emitters in Ha 
present r = 4.8 ± 0.6 h -1 Mpc, for the faintest emitters one finds 
2.5 ± 0.4 h -1 Mpc. It should also be noted that because 7 is fixed, 
the significant difference in ro for the two samples is not a result of 
the degeneracy between 7 and ro (see Figure 2 for a similar degen- 
eracy between /3 and A), contrarily to studies such as Shioya et al. 
(2008), which allow for both 7 and ro to vary; such note is valid 
throughout this paper. 

With the large sample of Ha emitters obtained by HiZELS at 
z — 0.84, it is possible to perform a much more detailed investiga- 
tion into the relation between ro and Ha luminosity by separating 
the emitters into a larger number of sub-samples. Figure 6 and Ta- 
ble 2 show that ro increases by a factor of almost 3 from the faintest 
Ha galaxies to the most active star-forming galaxies found above 
the break in the Ha luminosity function, L^ a (10 42 ' 2 erg s -1 , S09). 

Whilst the increase of ro with Lh q can be reasonably well- 
described by a straight-line fit ( \ 2 = 1-4 ), there are hints that there 
might be a stronger increase in ro around L^ a . This provides a de- 
scription which is in line with previous studies, but gives an un- 
precedented degree of detail of the relation between ro and Ha 
luminosity/star-formation rate. Moreover, the robustness of these 
results is also increased by including the small correction for the 
fact that samples with different luminosity limits will present a dif- 
ferent redshift distribution just from considering the filter profile. 

Recently, Orsi et al. (2009) made predictions for the cluster- 
ing properties of Ha emitters using two different versions of the 
GALFORM galaxy formation model. At z = 0.84 and for the flux 
limit of S09, Orsi et al. (2009) predicts r « 4 - 5h -1 Mpc. This 
shows that the disagreement between the data and the models found 
at z = 0.24 and z = 0.4 in Orsi et al. (2009) is also found at 
z — 0.84. The models predictions would be consistent with the 
data for flux limits 2 times higher due to the strong clustering de- 
pendence with Ha luminosity at z = 0.84; however, none of the 
models is able to fully reproduce this r -L,Hc« relation at the mo- 
ment. 



Table 2. The correlation length (fixing 7 = —1.8 and corrected by 5 per 
cent for contamination) for Ha emitters at 2 = 0.84. Samples with fixed 
M K have -24.44 < M K < -22.68 (AB) and those with fixed logL Hc « 
have 41.76 < logL Ha < 42.22 (ergs -1 ). Ha luminosities are given in 
erg s — 1 , Mk and M B are AB rest-frame luminosities. Unclassified sources 
or with no data available were not considered. Errors include cosmic vari- 
ance (1 1 per cent of ro for the entire sample and 14 per cent of ro for samples 
of each individual field). 
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4.2 Rest-frame continuum luminosity 

Ha emitters presenting the highest rest-frame K band luminosi- 
ties {Mk < —24) are strongly clustered, and a general trend of 
an increasing ro with K band luminosity is found (see left panel of 
Figure 7), similarly to what has been found for other populations 
of galaxies at different redshifts. This suggests that this correlation 
must be valid regardless of the population being observed - at least 
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Figure 7. The real-space correlation length as a function of rest-frame K luminosity (left panel), rest-frame B luminosity (middle panel) and visual morphology 
(right panel). These results show that star-forming galaxies cluster more with increasing rest-frame K luminosity (which traces stellar mass), with this trend 
being maintained (but slightly weakened) even when one uses sub-samples that have the same Ho luminosity distribution. No statistically significant correlation 
between ro with rest-frame B luminosity is found. Morphologically classified irregulars and mergers cluster more strongly than discs and non-mergers, but 
this is due to a different Ha luminosity and M K distribution within each population; matching these distributions results in a similar tq. 



for the range of luminosities probed - and seems to simply imply 
that galaxies with the highest stellar masses (as these are expected 
to correlate very closely with K luminosity, although the contribu- 
tion from the thermally pulsing asymptotic giant branch (TP-AGB) 
phase of stellar evolution can lead to some scatter in the Mk versus 
stellar mass relation; c.f. Maraston et al. 2006) reside in the most 
massive haloes. However, it is also clear that the increase of ro with 
rest-frame K luminosity, whilst continuous, is not as pronounced 
as the increase with star-formation rate. 

In the local Universe, galaxies with higher B band luminosi- 
ties (Mb) are more clustered than galaxies with lower Mb - How- 
ever, studying the clustering properties of Ha emitters at z — 0.845 
as a function of rest-frame B luminosity reveals no statistically sig- 
nificant trend (< la) (see middle panel of Figure 7). These results 
are in reasonable agreement with recent studies (e.g. Meneux et al. 
2009), which did not find any significant dependence of galaxy 
clustering on B luminosity for a similar redshift range. 



4.3 Morphological class 

By splitting the sample into morphological classes (only for the 
COSMOS field [c.f. S09]), one finds that galaxies classified as ir- 
regulars are more clustered than those classified as discs; mergers 
have a measured ro which lies between these populations, but sig- 
nificantly above the non-mergers (see Table 2 and right panel of 
Figure 7 - open circles). However, S09 found that irregulars and 
mergers are typically brighter in Ha and Mk than discs (which 
completely dominate the faint-end of the Ha luminosity function 
at z ~ 1). Therefore, in order to understand if the clustering re- 
ally does depend on the morphological class or if this is simply 
a secondary effect driven by star-formation rate and stellar mass 
dependencies, one needs to compare samples which are matched 
in logLHo, and Mk- This is done by using the distribution of Ha 
emitters in the log Lh q -Mk plane and matching irregulars with 
discs and mergers with non-mergers. A A log Lhq, < 0.02 & 
AMk < 0.02 criteria is used: these values were chosen to ensure 
that the samples are very well-matched in both Ha luminosity and 
Mk while still retaining the sample sizes required for the analysis. 
The population match used still results in a severe reduction of the 
larger disc and non-merger samples (along with a smaller reduction 
of the number of irregulars and mergers), but by matching these on 
the basis of Mk and log Lh q distribution, it is then possible to di- 



rectly compare these populations on the basis of the morphological 
class only. 

There is no significant difference in ro for the matched sam- 
ples of irregulars and discs (see Figure 7 - filled circles) within la. 
The ro difference between mergers and non-mergers is also greatly 
reduced, although mergers are still found to be slightly more clus- 
tered than non-mergers at ~ 1 a level. 

The sample of Ha emitters with disc morphologies has also 
been classified according to how much disc/bulge dominated each 
galaxy is (with ZEST; c.f. Scarlata et al. 2007). By computing the 
correlation length for disc dominated discs and bulge dominated 
discs, one finds that they present reasonably the same ro (2.7 ± 
0.5 h _1 Mpc for disc dominated and 2.5 ± 0.4 h _1 Mpc for bulge 
dominated galaxies). Therefore, the bulge-disk ratio of the studied 
star-forming disc galaxies does not have any significant effect on 
how these galaxies cluster at z ~ 1. 

These results show that Ha luminosity and Mk (probing stel- 
lar mass) are the key host galaxy properties driving the clustering 
of star-forming galaxies at z ~ 1 and morphology is unimportant 
for the clustering of galaxies at high redshift, just as has been found 
in the local Universe (e.g. Skibba et al. 2009). 



4.4 Ha luminosity versus rest-frame K luminosity 

It has been shown that ro is very well correlated with both Ha lumi- 
nosity (or star-formation rate) and rest-frame K luminosity (trac- 
ing stellar mass). On the other hand, star-forming galaxies both in 
the local Universe and at high redshift reveal a correlation between 
both, with star-formation rate typically being higher for galaxies 
with higher stellar masses (see Figure 8). Thus, how much is the 
correlation between r and Ha luminosity driven by different rest- 
frame K luminosity distributions, and vice-versa? 

In order to clearly test if both relations hold - or whether 
they are a result of a single strong relation between ro and either 
Ha luminosity of rest-frame K luminosity - new sub-samples are 
derived. To do this, two sub-regions are defined in the 2-D Lho- 
Mk space (see Figure 8). To test the to-Lh^ correlation at a fixed 
M K , the region -24.44 < M K < -22.68 (AB) and 41.71 < 
logL Ha < 42.44 ergs -1 (corresponding to 4.1 < SFR < 21.8 
M Q yr _1 ) is considered (see Figure 8). This region contains 409 
Ha emitters, which can be divided into 4 sub-samples on the basis 
of Ha luminosities with the same distribution of Mk (means of 
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Figure 8. The complete 2-D Mx-logLjja space, showing the correlation 
between rest-frame K luminosity and Ha luminosity, compared to the re- 
gions used to select sub-samples with the same Mk or Ljja distributions. 
This clearly shows that without using sub-regions of this 2-D space, sam- 
ples defined only on either Ljj a or Mk will present different Mk and 
L Ha distributions, respectively, since these quantities are correlated. 



-23.6, -23.7, -23.6 and -23.7 with standard deviations of 0.5 
in all instances; ordered from highest to lowest in respect to Ha 
luminosity). Similarly, in order to test the xq-Mk correlation, the 
sample is restricted to Ha emitters within the region defined by 
-24.96 < M K < -21.81 (AB) and 41.76 < logL H c« < 42.22 
ergs -1 (see Figure 8) in which sub-samples split on the basis of 
their Mk have the same Ln a distributions (means of 41.97, 41.95, 
41.95 and 41.97 and standard deviations of 0.12 in all instances; 
ordered from highest to lowest in Mk) - c.f. Table 2. The angular 
correlation functions of these matched sub-samples are then com- 
puted and values of ro derived as fully described before. 

The results are presented in Table 2 and in Figures 6 and 7. 
These reveal that both Ha luminosity and rest-frame K luminos- 
ity are relevant for the clustering of star-forming galaxies, since 
for a fixed distribution of one of these, a variation in the other one 
leads to a change in ro. However, it should be noted that the corre- 
lation between ro and Ha luminosity (for a fixed Mk distribution) 
is maintained as highly statistically significant, while the statistical 
significance of the correlation between r and M K at a fixed L Ha 
distribution is slightly reduced. 



Shioya et al. (2008) used galaxy colours to identify ~ 1000 
candidate z = 0.24 Ha emitters within their narrow-band ex- 
cess sample. For this analysis, the sample is restricted to 492 
sources by limiting it to a very robust completeness limit (Lh« > 
10 39 ' 8 ergs~\ corresponding to SFR > O.O5M yr _1 ). The ro- 
bustness of this sample can be tested and improved taking advan- 
tage of a large set of spectroscopic data recently made available 
in the COSMOS field by the zCOSMOS project. There are 75 Ha 
candidates with a spectroscopic redshift available from zCOSMOS 
and 69 are identified as Ha emitters at 0.236 < z < 0.252. 
The remaining 6 sources are identified as [Sll] 6717 emitters at 
0.229 < z < 0.231 (contamination by Sll emitters is even more 
significant at lower line fluxes, as pointed out recently by Westra 
et al. 2009). With the robust luminosity cut used here, this corre- 
sponds to a contamination of 8 per cent, and by removing the 6 
confirmed contaminants from the sample of Ha emitters, the con- 
tamination is estimated to be 7 per cent for the remaining sample 
of 486 emitters. This results in underestimating ro by a maximum 
of 8.4 per cent and ro will be corrected by 7 per cent to account 
for contamination in the sample at z — 0.24; this is 80 per cent 
of the maximum correction, consistent with the approach used for 
z = 0.84. 

Finally, in order to obtain ro (following the same procedures 
as for the z — 0.84 sample, except AGN contamination), two red- 
shift distributions are considered: the one assumed in Shioya et al. 
(2008; a top-hat characterized by z = 0.242 ± 0.009), and a red- 
shift distribution derived from the distribution of the 69 spectro- 
scopically confirmed Ha emitters, which can be well-described by 
a Gaussian with an average of z = 0.245 and a standard deviation 
of a = 0.006. In practice, the values derived from both distribu- 
tions agree well, but for consistency in determining ro, the Gaussian 
distribution will be assumed. 

The correlation length results in r = 1.8 ±0.2 h _1 Mpc, with 
the error including random errors (from the standard deviation ob- 
tained after 1000 measurements of ui(6)) and cosmic variance (as- 
sumed to affect r by 1 1 per cent for this area). Only separations of 
5 < 6 < 800 arcsec are used when obtaining a power-law fit: an- 
gular separations lower than 5 arcsec are not used as ui(6) becomes 
steeper, deviating from the power law; this is simply interpreted as 
the transition between the two-halo and one-halo clustering, hap- 
pening at ~ 20 kpc. It should also be noted that the Limber equa- 
tion works well for this case, with the differences between using 
such approximation and integrating the full relation between uj(9) 
and £ (r) being lower than 5 per cent for 6 < 800 arcsec. 



5 THE CLUSTERING OF Ha EMITTERS ACROSS 
COSMIC TIME 

The clustering properties of the Ha emitters at z = 0.84 can be 
compared with those of the HiZELS sample at z — 2.23 (G08) 
and with results derived from lower redshift surveys of Ha emitters 
at z = 0.24 and z = 0.4 (Shioya et al. 2008; Nakajima et al. 
2008). This can then be interpreted in the context of the ACDM 
cosmological model. 

5.1 The clustering of Ha emitters at z = 0.24 

With the motivation of reliably comparing the clustering measure- 
ments at different redshifts, and since the Shioya et al. (2008) sam- 
ple is publicly available, ui(9) is carefully re-computed for this 
sample. 



5.2 The clustering of Ha emitters at z = 2.23 

It has been mentioned that Limber's equation breaks down for large 
angular separations and for very narrow filters. Indeed, atz — 2.23 
the co-moving space width of the filter is extremely narrow, pre- 
senting a = 7.3 h -1 Mpc, and representing only « 0.2 per cent 
of the total co-moving distance to z = 2.23; this means that if 
the spatial correlation function of the Ha emitters is a power-law 
£ = (r/ro) -1 ' 8 , then u>(0), measured at 5 < 6 < 1000 arcsec can 
not be well-fitted by a power-law with /3 = —0.8, since this regime 
is probing the exact change between j3 = 7 + 1 and = 7. 

The correlation length ro is therefore re-computed. This is 
done by using the u{6) data points presented in G08 and by do- 
ing a x 2 fit t° me obtained u)(9) from Equation 3. This assumes 
that the spatial correlation function is a power-law with 7 = —1.8 
(which is shown to reproduce the data very well) and only ro is al- 
lowed to vary. This procedure results in r = 2.6 ± 0.5 h -1 Mpc, 



The clustering and evolution of Ha emitters at z ~ 1 from HiZELS 1 1 



4.5 
4.0 
3.5 



o 3.0 
3-2.5 



2.0 
1.5 
1.0 



This work . ■ 

SFR > l.OMi.yc- 1 ' — 
.Re-computed G08 
Nakajima et al. 2008 
Re-computed Shioya et al. 



. . DM Halos M udn « 10 13 M K 
_ DM Halos Mnm a 10 12 M G 
- DM Halos M mi „ m 10 n M E 




0.25 0.4 0.65 1.0 
Redshift {z) 



1.6 2.5 



13.5 



13 



05 

w 12.5 
E 

o 12 

05 
X 

.a 11.5 



a 11 

o 

— 10.5 



Ha z = 0.24 
Ha z = 0.84 
Ha z = 2.23 



+ 



40.0 40.5 41.0 41.5 42.0 42.5 43.0 43.5 
L Hq (ergs- 1 ) 



Figure 9. Left: The clustering length (ro) as a function of redshift for Ha emitters selected by narrow-band surveys. This reveals that the Ha emitters at 
z = 0.84 studied by HiZELS reside in typical dark matter haloes of M m ; n Rj 10 12 Mq, consistent with being the progenitors of Milky-Way type galaxies. 
The lower luminosity Ha emitters found in smaller volumes at z = 0.24 and z = 0.4 reside in less massive haloes; however, the most luminous Ha emitters 
at z = 0.24 cluster more strongly, and seem to reside in M m ; n S3 10 12 Mq, while Ha emitters at z = 2.23 reside in haloes just slightly less massive. Right: 
The minimum mass of host dark matter haloes as a function of Ha luminosity; this reveals that more luminous emitters reside in more massive haloes, at any 
given cosmic time, but it also shows that the relation between halo mass and Ha luminosity/SFR evolves across cosmic time, with M m ; n R3 10 12 Mq being 
much more effective (in respect to SFRs) at z ~ 2 than at z < 1. 



with the error being derived directly from the \ 2 fit. Note that this 
implies a best fit u)(6) with j3 w —1 (for 7 = —1.8) over the 
separations studied, agreeing well with the G08 best f3 fit to lj(8) 
of p — — 1. The value of ro is notably lower than the value of 
ro = 3.6 ± 0.4 h _1 Mpc derived by G08 from using the Limber ap- 
proximation and their fitted values for A and /3. A further correction 
is applied to account for a 15 per cent known contamination (the 
confirmed contaminants are all at different redshifts; this results in 
underestimating ro by 20 per cent) based on limited spectroscopic 
data available at the moment (J. Geach et al. in prep.); this finally 
results in ro = 3.1 ± 0.7h _1 Mpc (this also assumes a 14 per cent 
error in ro due to cosmic variance based on the area of the survey, 
but such uncertainty is likely to be under-estimated). 

5.3 The clustering evolution of Ha emitters since z = 2.23 

Figure 9 presents ro as a function of redshift for Ha emitters; this 
is the first combination of self-consistent clustering measurements 
for Ha emitters spanning more than half of the history of the Uni- 
verse (~ 8 Gyrs) whilst probing 4 different well-defined epochs. 
For comparison, the io(z) predictions for dark matter haloes with 
a fixed minimum mass of M m i n > 10 11-13 Mq (Matarrese et al. 
1997; Moscardini et al. 1998) are also shown. A ACDM cosmol- 
ogy is assumed, together with an evolving bias model b(z) from 
Moscardini et al. (1998) and the values from that study are used for 
various fixed minimum mass haloes (c.f. G08). 

The results show that Ha emitters found in the HiZELS survey 
at z = 0.84 (SFRs > 3M Q yr _1 ) reside in Milky-Way type haloes 
of M m i n ~ 10 12 Mq - the typical haloes where L* galaxies in the 
local Universe reside. This contrasts with the low-Ha-luminosity 
galaxies for the samples at z = 0.24 and z = 0.4 (presenting 
SFRs > 0.05 Mq yr _1 ). These seem to reside in much less massive 
haloes (M m i n ~ 10 1 Mq). These low redshift emitters also present 
very low stellar masses and low luminosities in all available bands 
and are therefore likely to be small, young, dwarf-like galaxies, 
very different from the already much more massive and active star- 
forming galaxies found at z = 0.84. 



Nevertheless, motivated by the correlation between ro and 
Lhc found both at z — 0.24 and z — 0.84, it is found that if one 
considers only the « 5 per cent most luminous emitters (in Ha) 
at z — 0.24 (a rough match in Lh q to the z = 0.84 sample), then 
these are much more clustered than the complete sample, present- 
ing ro ~ 2.4h _1 Mpc; this is consistent with these emitters resid- 
ing in dark matter haloes of M mm ~ 10 Mq. These emitters also 
present stellar masses closer to those presented by the z = 0.84 
Ha-selected population, suggesting that the brightest z — 0.24 Ha 
emitters and part of the sample at z — 0.84 might be related. 

At higher redshift (z — 2.23), one finds Ha emitters residing 
in the dark matter haloes around M m in ~ 10 12 Mq, and it is there- 
fore possible that at least a fraction of the very actively star-forming 
Ha emitters found at z = 2.23 (SFRs > 70 M Q yr" 1 ) by G08 with 
HiZELS will turn into the typical Ha emitters at z = 0.84 with a 
strong L^ a decrease. 



5.4 The dark matter host halo-L^ relation 

The right panel of Figure 9 presents how the minimum mass of the 
host dark matter halo changes with measured Ha luminosity (all lu- 
minosities are derived assuming a constant 1 mag of extinction as in 
S09). This shows that while the host halo mass increases with lumi- 
nosity at any given redshift, there seems to be a different relation for 
each redshift/epoch. For a given dark matter halo mass, one finds 
that star-formation is tremendously more effective at high-redshift 
than at lower-redshit; this difference is especially pronounced from 
z = 0.84 to z = 2.23. On the other hand, G08 and S09 demon- 
strated that there is a clear evolution in the Ha luminosity function, 
showing that the characteristic luminosity, L^ lct , evolves by a factor 
of > 20 from the local Universe to z = 2.23; the Ljfj a evolution is 
also most pronounced from z = 0.84 to z = 2.23. This suggests 
that there could be a relation between the host dark matter halo and 
Lh q found at different epochs. 

In order to investigate this, the results from the right panel of 
Figure 9 are shown in Figure 10 after scaling the measured lumi- 



12 D. Sobral et al. 



13.5 



13 



52 12.5 

CO 



O 12 

CO 

X 

.5 11.5 



a 11 

o 

_ 10.5 



• Ha z = 0.24 

• Ha z = 0.84 
■ Ha z = 2.23 



#4 



0.01 0.03 



0.1 0.3 

Lho/LhJz) 



1.0 



3.0 



Figure 10. The minimum mass of host dark matter haloes as a function of 
Lho^ho ( 2 ) (''[i ( z ) i s tne break in the Ha luminosity function, shown 
to evolve greatly from the local Universe up to z = 2.23 - see S09 for 
more details). This reveals a good agreement between the data at 3 very 
distinct epochs of the history of the Universe. This shows that luminous Ha 
emitters reside in more massive haloes, but also that across 8 billion years, 
emitters at the same fraction of L*^ a seem to reside in haloes of typically 
the same mass. 

nosities by L^ a found at each individual redshift 7 . Remarkably, 
this scaling reveals a clear relation between the typical dark mat- 
ter halo host of Ha emitters and the value of hua/L^, remov- 
ing essentially all of the evolution seen in Figure 9. Even though 
there is a significant evolution in the Ha emitters population from 
z — 2.23 to the present epoch, at L^ a , Ha emitters seem to re- 
side in M m i n ~ 10 13 M© dark matter haloes, independently of cos- 
mic time. Galaxies below the luminosity function break reside in 
least massive haloes, while (at least for z = 0.84), Ha emitters 
above L^ a seem to reside in haloes just slightly more massive than 
1O 13 M . 

As these results suggest an epoch-independent constancy be- 
tween L^a an d the minimum mass of the dark matter halo host of 
Ha emitters, it seems plausible to suggest a simple connection be- 
tween the two properties. Indeed, it is possible that the strong evo- 
lution in the break of the Ha luminosity function is being driven 
by the quenching of star-formation for galaxies residing in haloes 
much more massive than M w 10 12 Mq , either because such haloes 
favour very intense and fast bursts of star-formation, capable of us- 
ing the majority of the gas, or because such halo masses create the 
necessary conditions for AGN feedback to become important. 



6 CONCLUSIONS 

The clustering properties of Ha emitters at z = 0.84 have been 
fully detailed. These emitters are moderately clustered, with their 
angular correlation function u>(6) being very well fitted by a power 
law A8~P with A = 15.5 ±4.1 (for 8 in arcsec) and /3 = 



-0.81 ± 0.05 out to 600 arcsec (4.5 Mpc). The exact relation be- 
tween the spatial and angular correlation funtion is used to show 
that the Ha emitters at z = 0.84 present r = 2.7 ± 0.3 h" 1 Mpc 
for a spatial correlation function given by £ = (r/ro)~ 1,8 , with the 
errors accounting for cosmic variance. 

A strong dependence of the correlation length on Ha luminos- 
ity is found, with the most actively star-forming galaxies presenting 
5.2 ± 1.0 h _1 Mpc, while the lower Ha luminosity galaxies present 
a spatial correlation as low as 2.1 ± 0.6 h _1 Mpc. The correlation 
length also depends on rest-frame K luminosity (broadly tracing 
stellar mass) for star-forming galaxies at z ~ 1 but not on rest- 
frame B luminosity (or only very weakly). 

The ro correlation with Lh q is fully maintained at a fixed rest- 
frame K luminosity, clearly revealing that it is not a simple result of 
the Lhq-Mk correlation; the tq-Mk correlation is also maintained 
at a fixed Lh« distribution, but slightly weakened. 

Irregular galaxies and mergers are found to cluster more 
strongly than discs and non-mergers, respectively. This, however, 
seems to be driven by the different Ha luminosity and Mk dis- 
tributions of the distinct populations; once they are matched on the 
two properties, irregulars and discs present the same ro and mergers 
and non-mergers are consistent within ~ la. Bulge-to-disc fraction 
is also shown not to be important to the clustering of star-forming 
galaxies at z ~ 1, and morphology seems to be as unimportant for 
the clustering at high redshift as it is in the local Universe. 

Ha emitters at z = 0.84 found by HiZELS reside in minimum 
dark matter haloes of « 10 12 Mq similar to those of Milky Way 
type galaxies in the local Universe. Those are roughly the same 
as the haloes hosting the brightest Ha emitters at z = 0.24, and 
just slightly higher mass haloes than the hosts of Ha emitters at 
z = 2.23. Furthermore, the minimum dark matter halo mass host- 
ing Ha emitters increases with Ha luminosity, and a L^ a scaling 
is able to combine observational results probing the last 8 billion 
years of the age of the Universe in one single relation. This also 
suggests a connection between the strong L^ a evolution of the Ha 
luminosity function and star-formation being truncated in galaxies 
residing within dark matter haloes with masses much higher than 
« 1O 12 M . 

The results presented in this study provide important details 
on the clustering and evolution of Ha-selected star-forming galax- 
ies at z = 0.84, which are mostly discs, but with a significant pop- 
ulation of irregular and mergers above L^ a . Besides identifying 
and quantifying clustering relations with fundamental galaxy ob- 
servables at z ~ 1 for the first time, the results clearly show that 
the highest star-formation rates at any epoch only occur in galaxies 
residing within massive haloes. Dark matter haloes of a given mass 
seem to be more effective at providing the conditions for intense 
star-bursts at higher redshift. On the other hand, the fact that the 
brightest Ha emitters become rarer with cosmic time down to the 
present (seen by the strong evolution in the Ha luminosity func- 
tion; e.g. G08, S09) suggests that the most massive haloes not only 
provide the conditions and the environment required for the highest 
SFRs to take place, but they also seem to be the sites in which the 
quenching of star-formation in galaxies occurs across cosmic time. 



7 The best-fit values for LJj are taken from Shioya et al. (2008) at z = 
0.24, from S09 at z = 0.84, and from the recent determination of Hayes 
et al. (2009) at z = 2.23. The Hayes et al. measurement combines the 
results of G08 with deeper HAWK-I observations, and suggests a ~ 0.2 dex 
higher value of Lf, than G08 derived, but within the errors of that study. 
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